Histograms for OLAP and Data-Stream Queries
نویسنده
چکیده
Histograms are an important tool for data reduction both in the field of data-stream querying and in OLAP, since they allow us to represent large amount of data in a very compact structure, on which both efficient mining techniques and OLAP queries can be executed. Significant timeand memory-cost advantages may derive from data reduction, but the trade-off with the accuracy has to be managed in order to obtain considerable improvements of the overall capabilities of mining and OLAP tools. In this chapter we focus on histograms, that are shown in the recent literature to be one of the possible concrete answers to the above requirements.
منابع مشابه
Optimal Histograms for Hierarchical Range Queries Extended Abstract
Now there is tremendous interest in data warehousing and OLAP applications. OLAP applications typically view data as having multiple logical dimensions (e.g., product, location) with natural hierarchies de ned on each dimension, and analyze the behavior of various measure attributes (e.g., sales, volume) in terms of the dimensions. OLAP queries typically involve hierarchical selections on some ...
متن کاملA Approximate Range Queries by Histograms in Olap
Online analytical processing applications typically analyze a large amount of data by means of repetitive queries involving aggregate measures on such data. In fast OLAP applications, it is often advantageous to provide approximate answers to queries in order to achieve very high performances. A way to obtain this goal is by submitting queries on compressed data in place of the original ones. H...
متن کاملWhere is Business Intelligence taking today's Database Systems?
The invention of technology made Business Intelligence (BI) possible over relational engines, but now the experiences of putting them into production has unearthed a new set of problems in need of further invention. Over a period of few past years, academia has provided very performant and storage efficient technologies for fundamental BI objects: cubes (Dwarf, Quotient Cube), instigated resear...
متن کاملWhat Can Hierarchies Do for Data Streams?
Much effort has been put into building data streams management systems for querying data streams. Here, data streams have been viewed as a flow of low-level data items, e.g., sensor readings or IP packet data. Stream query languages have mostly been SQL-based, with the STREAM and TelegraphCQ languages as examples. However, there has been little work on supporting OLAP-like queries that provide ...
متن کاملScalable real-time OLAP on cloud architectures
In contrast to queries for on-line transaction processing (OLTP) systems that typically access only a small portion of a database, OLAP queries may need to aggregate large portions of a database which often leads to performance issues. In this paper we introduce CR-OLAP, a scalable Cloud based Real-time OLAP system based on a new distributed index structure for OLAP, the distributed PDCR tree. ...
متن کامل